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(54) System and method for ensuring the integrity of stored data 



(57) A system and method for determining when 
data stored in a data storage device is becoming unreli- 
able. A list of trigger events 1 02 is maintained, either on 
the media, or on a host processor, and the number of 
trigger events 101 pertaining to data retrieval errors is 
also maintained either on the disc media or on the host 
processor. Based upon the trigger events and the run- 

FIG. 1 



ning totals, the disc media is scanned for errors from 
time to time and the error detecting algorithm can 
change depending upon the respective trigger totals. 
The triggers are typically representative of events likely 
to cause errors to occur and can be tailored to different 
types of storage media. 
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Description 

TECHNICAL FIELD OF THE INVENTION 

[0001] This invention relates to systems and meth- 
ods for data storage protection and more specifically a 
system and method for monitoring data at various times 
under control of certain trigger events. 

BACKGROUND OF THE INVENTION 

[0002] Data storage devices have methods to 
ensure the integrity of the data. In one method, called 
"read-after-write," data is put onto the media, and is 
then immediately read to ensure that the data was cor- 
rectly written. There is also a process that separately 
and non-automatically looks at the media and checks to 
see that it can be read back from the device. This disc 
verification is a utility (called ScanDisk) that can be used 
on current PCs. 

[0003] The mechanism of read-after-write is disad- 
vantageous in a slow device in that it takes a significant 
amount of extra time to run that process. For example, a 
CD has a much slower read and write speed than does 
a hard drive. The data rates for a CD for writing are 
something today between 300 and 600 Kbytes per sec- 
ond, and a hard drive is on the order of ten times faster 
than that. The reading speed of a CD is about six times 
faster than the writing speed, and a hard drive reading 
speed is the same as the writing speed. Thus, if the user 
needs to conduct an operation on a CD where data is 
written and read back, it will be significantly slower than 
if the user were able to simply write the data to the disc 
without verifying it. This is how the file system for CD 
currently works. Data is written but not verified. 
[0004] When the user does a format operation to 
prepare a CD disc for taking data, the system typically 
writes to that disc to arrange where the data will ulti- 
mately be put and then verifies that the data is able to 
be read back. But after the data has been written, the 
user has no assurance that something hasn't happened 
to damage the disc between the time of format and the 
time of writing data. Any rewritable particular media will 
be used over a long period of time, and many things can 
happen to that disc. The disc can be scratched, get fin- 
gerprints and dirt on it, etc. - All of these factors will 
affect both the successful writing of additional data as 
well as reading of the data. 

[0005] On a hard disk, the media is in a very con- 
trolled, sealed environment. You cannot put a fingerprint 
on a hard disk You cannot get dirt on a hard disk. But a 
removable media is exposed to the environment, 
thereby making it more vulnerable to damage. CD 
media is particularly susceptible to this damage 
because it is the only rewritable media that does not 
have a protective case (i.e. audio tapes, VCR tapes, 
f loppy disks, etc.) When a user does the formatting and 
verifying, the disc is physically secure since it is in a 



device away from fingers, etc. But when the CD is 
removed from the drive problems arise. This is a signifi- 
cant difference between hard disk and CD technology. 
[0006] All data storage devices take advantage of 

5 error correction designed in the format There are many 
error correction algorithms. They are all effective in cor- 
recting errors and defects, but every method has limita- 
tions. A thick thumb print on a CD disc, for instance, will 
render sections of the CD disc unreadable. Since this 

10 type of reusable media is in a very unprotected environ- 
ment, it will be handled and is likely to become soiled 
and damaged at random times over a long period of 
time. Therefore, a method is needed to ensure the 
steadfastness of data integrity. 

15 

S UMMARY OF THE INVENTION 

[0007] These and other objects, features and tech- 
nical advantages are achieved by an improved system 
20 and method which automatically performs disc verifica- 
tion on the disc without any user intervention. It would 
run at appropriate intervals, determined by looking at 
certain parameters of the disc, in order to insure a min- 
imum risk of either data unreadability or data being writ- 
es ten on a location unsuccessfully. The improved system 
includes the error correction processes that currently 
apply to CD's, but the algorithms will look at the disc and 
the usage of the disc, in particular, how much the disc 
has been handled by a user, ft will then determine the 
30 appropriate time to automatically run the disc verifica- 
tion function and repair of the disc. 
[0008] One of the features of the error correction on 
a CD as compared to a hard disk, is that a hard disk will 
only tell the user whether the data is good or bad. Either 
35 the ECC worked or it did not work to correct the data. 
CD technology typically set a threshold on how much 
ECC is used before informing the system to relocate the 
data because of risk of lost data. CD technology also 
typically allows for the system to inquire how much ECC 
40 was used to read the data. The improved system uses 
this method of inquiring how much ECC was used to 
intelligently determine if disc verification is required. 
[0009] Since the process of assessing, cleaning up 
the disc, and relocating data is done in the background, 
45 this system achieves additional protection without hav- 
ing to do a read back after the write process. Conse- 
quently, the improved system gains the performance of 
a write-rt-and-go-on device, while preserving the protec- 
tion of being able to check the disc at regular and appro- 
so priate intervals. 

[0010] The foregoing has outlined rather broadly 
the features and technical advantages of the present 
invention in order that the detailed description of the 
invention that follows may be better understood. Addi- 
55 tional features and advantages of the invention will be 
described hereinafter which form the subject of the 
claims of the invention. It should be appreciated by 
those skilled in the art that the conception and the spe- 
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crfic embodiment disclosed may be readily utilized as a 
basis for modifying or designing other structures for car- 
rying out the same purposes of the present invention. It 
should also be realized by those skilled in the art that 
such equivalent constructions do not depart from the 5 
spirit and scope of the invention as set forth in the 
appended claims. 



BRIEF DESCRIPTION OF THE DRAWINGS 

[0011] For a more complete understanding of the 
present invention, and the advantages thereof, refer- 
ence is now made to the following descriptions taken in 
conjunction with the accompanying drawings, in which: 
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FIGURE 1 is a diagram of the operation of the trig- 
ger event recording; and 

FIGURE 2 is a diagram of the operation of error 
checking as a function of set trigger limits being 
exceeded. 20 

DETAILED DESCRIPTION OF THE INVENTION 

[0012] Turning now to FIGURE 1, box 101 which 
advantageously runs on a host processor, such as a 25 
PC, or other computing device, monitors the trigger 
events which occur. The definitions of the trigger events 
to be monitored are stored on the portable storage 
medium, (or on the host computer or on a network). Box 
102 contains and controls the triggers, which by way of 30 
example could be number of media inserts; number of 
hours of use; number of files that have changed; 
number of errors or type of errors per hour of use; 
change in the sparing table; or the differences in the 
files date and time stamps. 35 
[0013] One trigger could be a specified number of 
times that the media has been inserted into the drive 
unit of the host processor (not shown). The number of 
insertions is an indication that someone had to handle it 
at least x number of times because every time it is 40 
inserted it is by a handling process. You dont really 
know how many other times it has been handled when 
it is outside the drive so this is a little bit indirect. Each 
time the disc is inserted in the improved system a coun- 
ter on the disc would be incremented. When it reached 4s 
a predetermined value the disc verification process 
would begin. When completed the counter would be 
cleared. If data was recovered but took a predetermine 
amount of ECC the data would be automatically move to 
a good area on the disc. so 
[0014] A second trigger could be the number of 
hours of use of the storage media. 
[0015] The number of hours of use is a further 
abstraction. The medium could have been sitting inside 
the drive a long time but the assumption is that after the ss 
disc is used so many times it would have been swapped 
out with other discs. So it is getting at how long the disc 
has been used and assuming that if it has been used a 



number of periods of time it would have been removed 
several times. The system could actually record use on 
the medium, or on the host. The length of time the disc 
is in use could be stored on the disc in the improved 
system. This location would be updated each time the 
disc was in use. When it reached a predetermined value 
the disc verification process would begin. When com- 
pleted this location would be clearer. If data was recov- 
ered but took a predetermined amount of ECC the data 
would be automatically move to a good area on the disc. 
[0016] A third trigger could be the number of fifes 
that have changed. The improved system would watch 
the number of files that have been written and how 
many files have been removed. The number of files that 
have changed (added, modified or deleted) could be 
stored on the disc in the improved system. This location 
would be updated each time files changed. When it 
reached a predetermined value the disc verification 
process would begin. When completed this location 
would be cleared. If data was recovered but took a pre- 
determine amount of ECC the data would be automati- 
cally move to a good area on the disc. 
[0017] A fourth trigger is the number and type of 
errors received. By reading the data the system gener- 
ates information about how much of the ECC has been 
involved. That information can be requested along with 
the data and so the system can set its threshold based 
on that. In the current CDs there are three levels of error 
correction. Call them C1 , C2 and C3. Basically, C1 and 
C2 are combined and you get information about how 
much of C1 and C2 were required to recover the data if 
there was an error. And then you also get that same 
information on C3. If C1 and C2 could not correct it, it 
will go to C3 to correct it and the information as to what 
level of C3 error correction was required to recover the 
data is available. 

[0018] The point is even when you are successful 
reading the data, you can get some information about 
how difficult it was and the improved system would uti- 
lize the information that is available to make some addi- 
tional decisions about whether it is appropriate to look 
for errors on the entire disc. 

[001 9] If the improved system becomes aware that 
there are a high number of errors occurring during read- 
ing, this is an indication that the improved system 
should look at the entire disc rather than just what the 
user was reading right then. 

[0020] The improved system could be set up to 
read just the portions that have data or the entire disc. 
This is important since it increases the likelihood of 
errors at any location. Running the disc verification rou- 
tine will allow for the detection of trouble conditions 
before a user ever tries to write to that area. 
[0021 ] if the improved system determines that there 
are significant portions of bad areas on the disc, it could 
alert the user that it is time to copy to another disc. 
Another feature of the improved system is that it will 
detect contamination and could inform the user to clean 
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the disc before trouble occurs. 

[0022] The fifth trigger is a change in the sparing 
table. The sparing table is where the storage device 
records the places that cannot be written to. The system 
watches that table and as more locations are added to s 
that table the system knows that the disc is wearing out 
or that there is additional dirt on the disc. The file sys- 
tem writes that table. So the computer and device as it 
exists today manage the media and as they are unsuc- 
cessful in reading something back or get high error 10 
rates they will be relocating data, so our method 
watches that sparing table and determines when its 
time to look again at the whole disc. 
[0023] A fifth trigger could be to determine the dif- 
ference in data and time between the newest file and is 
the oldest file. This time delta would be saved on the 
disc. When the delta reaches a predetermined value the 
disc verification process would begin. When completed 
this value would be marked as verified. A new time delta 
would be started from the time delta marked as verified. 20 
When this new delta reached a predetermined value the 
disc verification process would begin. This new time 
delta would then be stored where the old time delta was 
and would be marked as verified. If data was recovered 
but took a predetermine amount of ECC the data would 25 
be automatically move to a good area on the disc. 
[0024] This method is similar to trigger numbers two 
and three. It has the advantage that continuous monitor- 
ing of the files on the disc is not required. It has the dis- 
advantage that only one file of significant time delay 30 
could trigger the disc verification process. 
[0025] Box 1 03 checks for trigger events and simply 
logs the various events into table 104. Some trigger 
events, however, need not be captured in a table since 
they are in tables already and that table acts as a trigger 35 
such that when that table gets too full (for example, the 
error table) the system reacts. 

[0026] FIGURE 2 shows the control of the algorithm 
that determines when to initiate a disc verification proc- 
ess automatically. The user will not have to invoke this 40 
procedure. Most users do not have the kind of knowl- 
edge to know that they should run a disc verification and 
thus a key point is to put the intelligence into the system 
to determine when a disc verification should be run. 
Even a very experienced user would not necessarily 45 
know when this should be done, because if the user had 
a CD for some time and then handed it to a friend, that 
friend would not know the state of that disc, nor how 
much it had been handled, nor how long it had been 
since it had been cleaned up, etc. Additionally, the so 
friend's device may perform differently with the disc. It 
may read with more or less errors do to better or worse 
optics. Using this improved system, a portable memory 
can have multiple users and multiple owners and still be 
cared for properly. ss 
[0027] Box 201 compares the trigger events table 
against each individual trigger event set limit box 202 to 
determine if the set limit has been reached. If it has 



been reached diagnosis are run, box 205, which diag- 
nosing could be a disc verification routine, or could be 
particular to a given type of trigger, if desired. Note also 
that the trigger limits can be adjusted, as desired, either 
by a user, or by the system or from information obtained 
externally, such as from a networkAweb. 
[0028] Box 206 reads the medium and box 207 
detects and stores errors, or invokes the error table and 
blocks certain portions of the medium from further use 
in cooperation with box 208. In some situations, the trig- 
ger event table is updated. Box 205 controls whatever 
action, such as notifying a user, changing speeds of the 
system, etc., as appropriate, given the error condition. 
[0029] There is actually a whole series of protocols 
that can be used around this concept. One is that noth- 
ing is done until the system reaches a certain threshold, 
for example the error table is 50% full. There could be a 
nonlinear progression so that as it gets more full disc 
verification is run more often. Another could be the rate 
of change you are using the disc and nothing is happen- 
ing in that table. All of a sudden the system starts to see 
the table fill up rapidly. The improved system could 
implement disc verification more often as the rate of that 
table fills up. 

[0030] The trigger event list as well as the trigger 
event occurrence list can be stored on the CD disc, on a 
public medium or on a host processor. The host proces- 
sor could transfer one or both of these lists to the porta- 
ble device at certain times, periodically, or upon certain 
events, such as the removal of the device from that host, 
or upon a command from the user. 
[0031] The improved system described is primarily 
designed for CD's running in conjunction with a PC or 
host processor. However, any system using a portable 
data storage medium can take advantage of the con- 
cepts taught herein. For example, DVD's and other stor- 
age devices can be scanned to know when their useful 
life is compromised, or to know when to move data to an 
alternate storage location before that data is lost for- 
ever. In some situations, even the hard drive of the proc- 
essor can be mainframed in this fashion. 
[0032] Although the present invention and its 
advantages have been described in detail, it should be 
understood that various changes, substitutions and 
alterations can be made herein without departing from 
the spirit and scope of the invention as defined by the 
appended claims. 

Claims 

1. A method of improving the long term data integrity 
of a removable storage medium by automatically 
detecting when data on a said storage media has 
reached an unreliable condition, said method com- 
prising the steps of: 

maintaining a running total of certain prede- 
fined trigger events 101 ; and 
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checking 103 said portable media data errors 
in accordance with an established list 102 of 
said trigger events. 

2. The method of claim 1 wherein said established list 5 
of trigger events 102 is common to a number of said 
devices and wherein said running total 101 is 
unique to said media. 

3. The method of claim 1 wherein said running total 10 

101 is maintained on said media. 

4. The method of claim 1 wherein said established list 
of trigger events 102 is maintained on said media. 

75 

5. The method of claim 1 wherein said trigger events 

102 is maintained on a host device to which said 
portable device is temporarily connected. 

6. The method of claim 1 wherein said running totals 20 

101 are maintained on a host device to which said 
portable media is temporarily connected. 

7. The method of claim 6 wherein said host device 
maintained totals 101 are transferred to said porta- 25 
ble media from time to time. 

8. The method of claim 1 wherein said checking step 
includes the step of: tailoring said checking fre- 
quency to the past history of said media. 30 

9. The method of claim 1 wherein said checking step 
includes the steps of: determining which type of 
check 103 should be performed at any given time; 
and tailoring the type of check 103 to the past his- 35 
tory of said media. 

10. The method of claim 1 wherein said trigger events 

102 are selected from the list consisting of: 

40 

a) number of media inserts; 

b) number of hours of use; 

c) number of files that have changed; 

d) number of errors per measure period; 

e) certain changes of the sparing table. 45 
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